Overall Summary of Data After Error and Duplication Simulation (and before cleaning)

Total induced errors = 2508

Total induced duplications = 30310

Total induced complete duplications = 29892

Mean = 63.6137728

Median = 35.38

SD = 1135.6975017

Range = 0.00221, 120630

Number of observations = 255188

Number of individuals = 42803

Graph of data before GCO method

Overall Summary of GCO Method Performance (after data cleaning)

Mean = 41.3384617

Median = 35.3

SD = 30.8590055

Range = 0.57, 249.967

Number of observations = 235153

Number of individuals = 42803

Sensitivity = 56.3795853

Specificity = 99.9556752

Total deletions = 20035

Percentage deleted = 7.8510745

Complete duplications deletions (or step 1 of algorithm) = 18509

Non-complete (identical) duplications deletions (or step 2 of algorithm) = 281

Total modifications (or step 3 of algorithm) = 0

Add 100 modifications = 0

Add 1000 modifications = 0

Subtract 100 modifications = 0

Subtract 1000 modifications = 0

Multiply by 10 modifications = 0

Multiply by 100 modifications = 0

Multiply by 1000 modifications = 0

Divide by 10 modifications = 0

Divide by 100 modifications = 0

Divide by 1000 modifications = 0

Convert to kg modifications = 0

Convert to lbs modifications = 0

Transpose modifications = 0

Jumping measurements deletions (or step 4 of algorithm) = 0

Implausible measurements deletions (or step 5 of algorithm) = 0

Graph of data after GCO method